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We report here the draft genome sequences of nine enteropathogenic Escherichia coli (EPEC) strains isolated from children in 
Kenya who died during hospitalization with diarrhea. Each of the isolates possess the EPEC adherence factor (EAF) plasmid en- 
coding the bundle-forming pilus, which is characteristic of EPEC. These isolates represent diverse serogroups and EPEC phylog- 
enomic lineages. 
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Attaching and effacing Escherichia coli (AEEC) is characterized 
by the presence of the locus of enterocyte effacement (LEE) 
pathogenicity region that encodes a type III secretion system in- 
volved in pathogenesis (1-3). AEEC is further identified as either 
LEE-positive Shiga toxin-producing E. coli (STEC) or entero- 
pathogenic E. coli (EPEC). EPEC strains contain the LEE region 
but not the Shiga toxin-encoding phage, and they are considered 
typical EPEC when they possess the bundle-forming pilus (BFP) 
encoded by the EPEC adherence factor (EAF) plasmid (1,4, 5). In 
contrast, LEE-positive sfx-negative fo/p-negative E. coli strains are 
considered atypical EPEC (4). The nine E. coli isolates de- 
scribed here are LEE positive, stx negative, and bfp positive and 
are therefore classified as typical EPEC based on their virulence 
factor content. 

The genomic DNA for sequencing was isolated from an over- 
night culture of each isolate using the Sigma GenElute kit (Sigma- 
Aldrich). Genome sequencing was performed at the University of 
Maryland School of Medicine, Institute for Genome Sciences, at 
the Genome Resource Center. The genome sequences were 
generated using paired-end libraries with 300-bp inserts on the 
lUumina HiSeq 2000. The lUumina reads generated for each 
isolate were assembled de novo using the Velvet assembly pro- 
gram (6), with fc-mer values determined using VelvetOptimiser 
version 2. 1 .4 (http://bioinformatics.net.au/software.velvetoptimiser. 
shtml). The genome assemblies contained an average of 181 con- 
tigs per isolate (range, 117 to 283), with an average G+C content 
of 50.4% (range, 50.3 to 50.6%) and an average genome size of 
5.2 Mb (range, 5.0 to 5.4 Mb). 

These nine isolates were isolated from children in Kenya who 
died during hospitalization with diarrhea (7). These EPEC isolates 
have diverse serogroups (0126, 0119, Olll, 053, 0127, 055, 
and Ol 14). Phylogenomic analysis of the whole genomes demon- 
strated that these isolates occur in two E. coli phylogroups (B 1 and 
B2) (8, 9) and occupy four EPEC phylogenomic lineages. Among 



the four EPEC isolates in phylogroup B2 was isolate S6995 (0127), 
which was identified in the EPEC 1 phylogenomic lineage, with the 
most widely studied prototype EPEC isolate E2348/69 (10). Also 
from phylogroup B2 was EPEC isolate S6400 (0119), which was 
identified in the recently described EPEC4 phylogenomic lineage 
(11). The remaining two EPEC isolates identified in phylogroup 
B2, S6966 (053) and S7438 (055), were present in unassigned 
phylogenomic lineages. The five EPEC isolates identified in phy- 
logroup Bl, S7005 (0126), S6685 (0114), S5274 (Olll), S7380 
(serogroup unknown), and S6662 (Olll), were all present in the 
EPEC2 phylogenomic lineage, which contains the EPEC proto- 
type isolate B171 (12). 

The genome sequencing of these nine EPEC isolates demon- 
strates that the EPEC isolates associated with diarrheal illness of 
children in Kenya exhibit considerable genomic diversity that was 
not previously appreciated. 

Nucleotide sequence accession numbers. The genome as- 
semblies have been deposited at DDBJ/EMBL/GenBank with 
accession no. JICIOOOOOOOO, JICHOOOOOOOO, JICGOOOOOOOO, 
JICFOOOOOOOO, JICEOOOOOOOO, JICDOOOOOOOO, JICGOOOOOOOO, 
JICBOOOOOOOO, and JICAOOOOOOOO, for isolates S6966, S6995, 
S7005, S7438, S6400, S7380, S6662, S5274, and S6685, respec- 
tively. 
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